Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 5002 |
| Missing cells | 1816 |
| Missing cells (%) | 2.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.2 MiB |
| Average record size in memory | 665.9 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 5 |
| Numeric | 8 |
Ruta has a high cardinality: 4119 distinct values | High cardinality |
OperadOR has a high cardinality: 2267 distinct values | High cardinality |
route has a high cardinality: 3837 distinct values | High cardinality |
ac_type has a high cardinality: 2462 distinct values | High cardinality |
summary has a high cardinality: 4851 distinct values | High cardinality |
all_aboard is highly overall correlated with PASAJEROS A BORDO and 3 other fields | High correlation |
PASAJEROS A BORDO is highly overall correlated with all_aboard and 3 other fields | High correlation |
crew_aboard is highly overall correlated with all_aboard and 3 other fields | High correlation |
cantidad de fallecidos is highly overall correlated with all_aboard and 4 other fields | High correlation |
passenger_fatalities is highly overall correlated with all_aboard and 2 other fields | High correlation |
crew_fatalities is highly overall correlated with crew_aboard and 1 other fields | High correlation |
route has 759 (15.2%) missing values | Missing |
PASAJEROS A BORDO has 219 (4.4%) missing values | Missing |
crew_aboard has 217 (4.3%) missing values | Missing |
passenger_fatalities has 233 (4.7%) missing values | Missing |
crew_fatalities has 233 (4.7%) missing values | Missing |
summary has 59 (1.2%) missing values | Missing |
ground is highly skewed (γ1 = 48.95716288) | Skewed |
Ruta is uniformly distributed | Uniform |
summary is uniformly distributed | Uniform |
PASAJEROS A BORDO has 866 (17.3%) zeros | Zeros |
cantidad de fallecidos has 76 (1.5%) zeros | Zeros |
passenger_fatalities has 1037 (20.7%) zeros | Zeros |
crew_fatalities has 398 (8.0%) zeros | Zeros |
ground has 4710 (94.2%) zeros | Zeros |
Reproduction
| Analysis started | 2023-05-23 15:31:15.881832 |
|---|---|
| Analysis finished | 2023-05-23 15:31:38.144324 |
| Duration | 22.26 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
fecha
Date
| Distinct | 4571 |
|---|---|
| Distinct (%) | 91.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| Minimum | 1915-03-05 00:00:00 |
|---|---|
| Maximum | 2021-07-06 00:00:00 |
Ruta
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 4119 |
|---|---|
| Distinct (%) | 82.4% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Memory size | 419.8 KiB |
| Moscow, Russia | 16 |
|---|---|
| Manila, Philippines | 15 |
| New York, New York | 14 |
| Cairo, Egypt | 13 |
| Sao Paulo, Brazil | 13 |
| Other values (4114) |
Length
| Max length | 72 |
|---|---|
| Median length | 49 |
| Mean length | 20.808685 |
| Min length | 5 |
Characters and Unicode
| Total characters | 103981 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3683 ? |
|---|---|
| Unique (%) | 73.7% |
Sample
| 1st row | Tienen, Belgium |
|---|---|
| 2nd row | Off Cuxhaven, Germany |
| 3rd row | Near Jambol, Bulgeria |
| 4th row | Billericay, England |
| 5th row | Potters Bar, England |
Common Values
| Value | Count | Frequency (%) |
| Moscow, Russia | 16 | 0.3% |
| Manila, Philippines | 15 | 0.3% |
| New York, New York | 14 | 0.3% |
| Cairo, Egypt | 13 | 0.3% |
| Sao Paulo, Brazil | 13 | 0.3% |
| Bogota, Colombia | 12 | 0.2% |
| Rio de Janeiro, Brazil | 12 | 0.2% |
| Chicago, Illinois | 11 | 0.2% |
| Near Moscow, Russia | 11 | 0.2% |
| Tehran, Iran | 10 | 0.2% |
| Other values (4109) | 4870 |
Length
| Value | Count | Frequency (%) |
| near | 1349 | 9.2% |
| off | 350 | 2.4% |
| russia | 255 | 1.7% |
| new | 228 | 1.6% |
| brazil | 176 | 1.2% |
| colombia | 153 | 1.0% |
| canada | 130 | 0.9% |
| france | 126 | 0.9% |
| california | 117 | 0.8% |
| mexico | 113 | 0.8% |
| Other values (4150) | 11636 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13024 | 12.5% |
| 9689 | 9.3% | |
| e | 7062 | 6.8% |
| i | 6555 | 6.3% |
| n | 6538 | 6.3% |
| r | 6022 | 5.8% |
| o | 5362 | 5.2% |
| , | 5204 | 5.0% |
| l | 3997 | 3.8% |
| s | 3525 | 3.4% |
| Other values (80) | 37003 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 74010 | |
| Uppercase Letter | 14718 | 14.2% |
| Space Separator | 9690 | 9.3% |
| Other Punctuation | 5351 | 5.1% |
| Dash Punctuation | 103 | 0.1% |
| Decimal Number | 66 | 0.1% |
| Control | 21 | < 0.1% |
| Open Punctuation | 11 | < 0.1% |
| Close Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13024 | |
| e | 7062 | |
| i | 6555 | |
| n | 6538 | |
| r | 6022 | 8.1% |
| o | 5362 | 7.2% |
| l | 3997 | 5.4% |
| s | 3525 | 4.8% |
| t | 3103 | 4.2% |
| u | 2753 | 3.7% |
| Other values (31) | 16069 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2029 | |
| C | 1453 | 9.9% |
| S | 1144 | 7.8% |
| M | 998 | 6.8% |
| B | 951 | 6.5% |
| A | 919 | 6.2% |
| P | 787 | 5.3% |
| I | 720 | 4.9% |
| R | 652 | 4.4% |
| O | 586 | 4.0% |
| Other values (17) | 4479 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 1 | 15 | |
| 2 | 9 | 13.6% |
| 5 | 8 | 12.1% |
| 8 | 3 | 4.5% |
| 3 | 2 | 3.0% |
| 7 | 2 | 3.0% |
| 9 | 2 | 3.0% |
| 6 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5204 | |
| . | 115 | 2.1% |
| ' | 24 | 0.4% |
| / | 6 | 0.1% |
| & | 1 | < 0.1% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9689 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 16 | ||
| 5 | 23.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88728 | |
| Common | 15253 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13024 | |
| e | 7062 | 8.0% |
| i | 6555 | 7.4% |
| n | 6538 | 7.4% |
| r | 6022 | 6.8% |
| o | 5362 | 6.0% |
| l | 3997 | 4.5% |
| s | 3525 | 4.0% |
| t | 3103 | 3.5% |
| u | 2753 | 3.1% |
| Other values (58) | 30787 |
Common
| Value | Count | Frequency (%) |
| 9689 | ||
| , | 5204 | |
| . | 115 | 0.8% |
| - | 103 | 0.7% |
| 0 | 24 | 0.2% |
| ' | 24 | 0.2% |
| 16 | 0.1% | |
| 1 | 15 | 0.1% |
| ( | 11 | 0.1% |
| ) | 11 | 0.1% |
| Other values (12) | 41 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103939 | |
| None | 42 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 13024 | 12.5% |
| 9689 | 9.3% | |
| e | 7062 | 6.8% |
| i | 6555 | 6.3% |
| n | 6538 | 6.3% |
| r | 6022 | 5.8% |
| o | 5362 | 5.2% |
| , | 5204 | 5.0% |
| l | 3997 | 3.8% |
| s | 3525 | 3.4% |
| Other values (63) | 36961 |
None
| Value | Count | Frequency (%) |
| é | 14 | |
| ö | 5 | 11.9% |
| ó | 4 | 9.5% |
| Ã | 4 | 9.5% |
| á | 2 | 4.8% |
| ï | 2 | 4.8% |
| è | 1 | 2.4% |
| ô | 1 | 2.4% |
| Ã | 1 | 2.4% |
| ä | 1 | 2.4% |
| Other values (7) | 7 |
OperadOR
Categorical
| Distinct | 2267 |
|---|---|
| Distinct (%) | 45.4% |
| Missing | 9 |
| Missing (%) | 0.2% |
| Memory size | 412.1 KiB |
| Aeroflot | 253 |
|---|---|
| Military - U.S. Air Force | 141 |
| Air France | 74 |
| Deutsche Lufthansa | 63 |
| United Air Lines | 44 |
| Other values (2262) |
Length
| Max length | 65 |
|---|---|
| Median length | 47 |
| Mean length | 18.958342 |
| Min length | 3 |
Characters and Unicode
| Total characters | 94659 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1734 ? |
|---|---|
| Unique (%) | 34.7% |
Sample
| 1st row | Military - German Navy |
|---|---|
| 2nd row | Military - German Navy |
| 3rd row | Military - German Army |
| 4th row | Military - German Navy |
| 5th row | Military - German Navy |
Common Values
| Value | Count | Frequency (%) |
| Aeroflot | 253 | 5.1% |
| Military - U.S. Air Force | 141 | 2.8% |
| Air France | 74 | 1.5% |
| Deutsche Lufthansa | 63 | 1.3% |
| United Air Lines | 44 | 0.9% |
| Military - U.S. Army Air Forces | 43 | 0.9% |
| China National Aviation Corporation | 43 | 0.9% |
| Pan American World Airways | 41 | 0.8% |
| American Airlines | 37 | 0.7% |
| US Aerial Mail Service | 35 | 0.7% |
| Other values (2257) | 4219 |
Length
| Value | Count | Frequency (%) |
| air | 1481 | 10.3% |
| 957 | 6.7% | |
| airlines | 840 | 5.8% |
| military | 774 | 5.4% |
| force | 557 | 3.9% |
| airways | 453 | 3.2% |
| u.s | 300 | 2.1% |
| aeroflot | 265 | 1.8% |
| lines | 184 | 1.3% |
| royal | 152 | 1.1% |
| Other values (2079) | 8415 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10203 | 10.8% |
| 9409 | 9.9% | |
| r | 8841 | 9.3% |
| a | 7776 | 8.2% |
| e | 6777 | 7.2% |
| n | 5526 | 5.8% |
| A | 5082 | 5.4% |
| o | 4380 | 4.6% |
| l | 4075 | 4.3% |
| s | 4000 | 4.2% |
| Other values (77) | 28590 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 68125 | |
| Uppercase Letter | 15056 | 15.9% |
| Space Separator | 9410 | 9.9% |
| Dash Punctuation | 935 | 1.0% |
| Other Punctuation | 865 | 0.9% |
| Open Punctuation | 115 | 0.1% |
| Close Punctuation | 115 | 0.1% |
| Decimal Number | 30 | < 0.1% |
| Control | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10203 | |
| r | 8841 | |
| a | 7776 | |
| e | 6777 | |
| n | 5526 | |
| o | 4380 | |
| l | 4075 | 6.0% |
| s | 4000 | 5.9% |
| t | 3916 | 5.7% |
| c | 1996 | 2.9% |
| Other values (28) | 10635 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5082 | |
| M | 1213 | 8.1% |
| S | 1136 | 7.5% |
| C | 910 | 6.0% |
| F | 901 | 6.0% |
| T | 679 | 4.5% |
| L | 661 | 4.4% |
| U | 532 | 3.5% |
| P | 512 | 3.4% |
| N | 493 | 3.3% |
| Other values (16) | 2937 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 7 | 4 | |
| 4 | 4 | |
| 2 | 3 | |
| 5 | 3 | |
| 1 | 3 | |
| 8 | 2 | 6.7% |
| 6 | 2 | 6.7% |
| 9 | 2 | 6.7% |
| 3 | 2 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 714 | |
| / | 109 | 12.6% |
| ' | 25 | 2.9% |
| , | 10 | 1.2% |
| & | 6 | 0.7% |
| ? | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9409 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 6 | ||
| 2 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 935 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83181 | |
| Common | 11478 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10203 | |
| r | 8841 | 10.6% |
| a | 7776 | 9.3% |
| e | 6777 | 8.1% |
| n | 5526 | 6.6% |
| A | 5082 | 6.1% |
| o | 4380 | 5.3% |
| l | 4075 | 4.9% |
| s | 4000 | 4.8% |
| t | 3916 | 4.7% |
| Other values (54) | 22605 |
Common
| Value | Count | Frequency (%) |
| 9409 | ||
| - | 935 | 8.1% |
| . | 714 | 6.2% |
| ( | 115 | 1.0% |
| ) | 115 | 1.0% |
| / | 109 | 0.9% |
| ' | 25 | 0.2% |
| , | 10 | 0.1% |
| 6 | 0.1% | |
| & | 6 | 0.1% |
| Other values (13) | 34 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94536 | |
| None | 123 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10203 | 10.8% |
| 9409 | 10.0% | |
| r | 8841 | 9.4% |
| a | 7776 | 8.2% |
| e | 6777 | 7.2% |
| n | 5526 | 5.8% |
| A | 5082 | 5.4% |
| o | 4380 | 4.6% |
| l | 4075 | 4.3% |
| s | 4000 | 4.2% |
| Other values (64) | 28467 |
None
| Value | Count | Frequency (%) |
| é | 102 | |
| á | 5 | 4.1% |
| Ã | 2 | 1.6% |
| Ã | 2 | 1.6% |
| ó | 2 | 1.6% |
| ç | 2 | 1.6% |
| ï | 2 | 1.6% |
| ã | 1 | 0.8% |
| ú | 1 | 0.8% |
| ê | 1 | 0.8% |
| Other values (3) | 3 | 2.4% |
route
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 3837 |
|---|---|
| Distinct (%) | 90.4% |
| Missing | 759 |
| Missing (%) | 15.2% |
| Memory size | 393.6 KiB |
| Training | 96 |
|---|---|
| Sightseeing | 31 |
| Test flight | 22 |
| Sao Paulo - Rio de Janeiro | 7 |
| Test | 6 |
| Other values (3832) |
Length
| Max length | 59 |
|---|---|
| Median length | 51 |
| Mean length | 22.174169 |
| Min length | 4 |
Characters and Unicode
| Total characters | 94085 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3630 ? |
|---|---|
| Unique (%) | 85.6% |
Sample
| 1st row | Shuttle |
|---|---|
| 2nd row | Venice Taliedo |
| 3rd row | Paris - Hounslow |
| 4th row | Washington - Newark |
| 5th row | London - Paris |
Common Values
| Value | Count | Frequency (%) |
| Training | 96 | 1.9% |
| Sightseeing | 31 | 0.6% |
| Test flight | 22 | 0.4% |
| Sao Paulo - Rio de Janeiro | 7 | 0.1% |
| Test | 6 | 0.1% |
| Rio de Janeiro - Sao Paulo | 5 | 0.1% |
| Villavicencio - Mitu | 4 | 0.1% |
| Bogota - Barranquilla | 4 | 0.1% |
| Barranquilla - Bogota | 4 | 0.1% |
| Croydon - Paris | 4 | 0.1% |
| Other values (3827) | 4060 | |
| (Missing) | 759 | 15.2% |
Length
| Value | Count | Frequency (%) |
| 4633 | ||
| city | 213 | 1.3% |
| new | 149 | 0.9% |
| san | 140 | 0.8% |
| york | 117 | 0.7% |
| paris | 116 | 0.7% |
| training | 103 | 0.6% |
| de | 101 | 0.6% |
| london | 88 | 0.5% |
| moscow | 84 | 0.5% |
| Other values (3628) | 11078 |
Most occurring characters
| Value | Count | Frequency (%) |
| 12644 | 13.4% | |
| a | 9832 | 10.5% |
| n | 5567 | 5.9% |
| o | 5501 | 5.8% |
| i | 5241 | 5.6% |
| e | 5106 | 5.4% |
| - | 4927 | 5.2% |
| r | 4485 | 4.8% |
| l | 3419 | 3.6% |
| s | 3072 | 3.3% |
| Other values (82) | 34291 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 62450 | |
| Uppercase Letter | 12939 | 13.8% |
| Space Separator | 12645 | 13.4% |
| Dash Punctuation | 4931 | 5.2% |
| Other Punctuation | 1065 | 1.1% |
| Control | 30 | < 0.1% |
| Decimal Number | 16 | < 0.1% |
| Final Punctuation | 4 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9832 | |
| n | 5567 | 8.9% |
| o | 5501 | 8.8% |
| i | 5241 | 8.4% |
| e | 5106 | 8.2% |
| r | 4485 | 7.2% |
| l | 3419 | 5.5% |
| s | 3072 | 4.9% |
| t | 3002 | 4.8% |
| u | 2566 | 4.1% |
| Other values (30) | 14659 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1232 | 9.5% |
| B | 1140 | 8.8% |
| S | 1081 | 8.4% |
| A | 1045 | 8.1% |
| M | 1042 | 8.1% |
| P | 823 | 6.4% |
| L | 788 | 6.1% |
| T | 709 | 5.5% |
| K | 640 | 4.9% |
| N | 631 | 4.9% |
| Other values (18) | 3808 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 4 | 3 | |
| 1 | 3 | |
| 7 | 2 | |
| 2 | 2 | |
| 8 | 1 | 6.2% |
| 6 | 1 | 6.2% |
| 0 | 1 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 915 | |
| . | 98 | 9.2% |
| / | 20 | 1.9% |
| ' | 20 | 1.9% |
| ? | 6 | 0.6% |
| : | 5 | 0.5% |
| \ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12644 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4927 | |
| – | 4 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 29 | ||
| 1 | 3.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 75389 | |
| Common | 18696 | 19.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9832 | 13.0% |
| n | 5567 | 7.4% |
| o | 5501 | 7.3% |
| i | 5241 | 7.0% |
| e | 5106 | 6.8% |
| r | 4485 | 5.9% |
| l | 3419 | 4.5% |
| s | 3072 | 4.1% |
| t | 3002 | 4.0% |
| u | 2566 | 3.4% |
| Other values (58) | 27598 |
Common
| Value | Count | Frequency (%) |
| 12644 | ||
| - | 4927 | 26.4% |
| , | 915 | 4.9% |
| . | 98 | 0.5% |
| 29 | 0.2% | |
| / | 20 | 0.1% |
| ' | 20 | 0.1% |
| ? | 6 | < 0.1% |
| : | 5 | < 0.1% |
| ’ | 4 | < 0.1% |
| Other values (14) | 28 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93956 | |
| None | 121 | 0.1% |
| Punctuation | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12644 | 13.5% | |
| a | 9832 | 10.5% |
| n | 5567 | 5.9% |
| o | 5501 | 5.9% |
| i | 5241 | 5.6% |
| e | 5106 | 5.4% |
| - | 4927 | 5.2% |
| r | 4485 | 4.8% |
| l | 3419 | 3.6% |
| s | 3072 | 3.3% |
| Other values (63) | 34162 |
None
| Value | Count | Frequency (%) |
| é | 38 | |
| Ã | 21 | |
| á | 15 | 12.4% |
| ó | 14 | 11.6% |
| ü | 6 | 5.0% |
| ã | 6 | 5.0% |
| ç | 4 | 3.3% |
| è | 4 | 3.3% |
| ÃŽ | 3 | 2.5% |
| ö | 2 | 1.7% |
| Other values (7) | 8 | 6.6% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 4 | |
| – | 4 |
ac_type
Categorical
| Distinct | 2462 |
|---|---|
| Distinct (%) | 49.3% |
| Missing | 13 |
| Missing (%) | 0.3% |
| Memory size | 407.9 KiB |
| Douglas DC-3 | 333 |
|---|---|
| de Havilland Canada DHC-6 Twin Otter 300 | 81 |
| Douglas C-47A | 70 |
| Douglas C-47 | 64 |
| Douglas DC-4 | 41 |
| Other values (2457) |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 18.543997 |
| Min length | 4 |
Characters and Unicode
| Total characters | 92516 |
|---|---|
| Distinct characters | 77 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1857 ? |
|---|---|
| Unique (%) | 37.2% |
Sample
| 1st row | Zeppelin L-8 (airship) |
|---|---|
| 2nd row | Zeppelin L-10 (airship) |
| 3rd row | Schutte-Lanz S-L-10 (airship) |
| 4th row | Zeppelin L-32 (airship) |
| 5th row | Zeppelin L-31 (airship) |
Common Values
| Value | Count | Frequency (%) |
| Douglas DC-3 | 333 | 6.7% |
| de Havilland Canada DHC-6 Twin Otter 300 | 81 | 1.6% |
| Douglas C-47A | 70 | 1.4% |
| Douglas C-47 | 64 | 1.3% |
| Douglas DC-4 | 41 | 0.8% |
| Antonov AN-26 | 35 | 0.7% |
| Yakovlev YAK-40 | 35 | 0.7% |
| Junkers JU-52/3m | 30 | 0.6% |
| De Havilland DH-4 | 27 | 0.5% |
| Douglas DC-6B | 27 | 0.5% |
| Other values (2452) | 4246 |
Length
| Value | Count | Frequency (%) |
| douglas | 1130 | 8.3% |
| boeing | 418 | 3.1% |
| dc-3 | 387 | 2.8% |
| lockheed | 332 | 2.4% |
| de | 294 | 2.2% |
| havilland | 292 | 2.1% |
| antonov | 288 | 2.1% |
| canada | 159 | 1.2% |
| otter | 146 | 1.1% |
| ilyushin | 142 | 1.0% |
| Other values (2522) | 10011 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8641 | 9.3% | |
| - | 5178 | 5.6% |
| e | 4833 | 5.2% |
| o | 4638 | 5.0% |
| a | 4631 | 5.0% |
| n | 3852 | 4.2% |
| l | 3690 | 4.0% |
| i | 3474 | 3.8% |
| r | 3299 | 3.6% |
| C | 3033 | 3.3% |
| Other values (67) | 47247 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46357 | |
| Uppercase Letter | 17887 | 19.3% |
| Decimal Number | 13806 | 14.9% |
| Space Separator | 8642 | 9.3% |
| Dash Punctuation | 5178 | 5.6% |
| Other Punctuation | 264 | 0.3% |
| Open Punctuation | 188 | 0.2% |
| Close Punctuation | 187 | 0.2% |
| Math Symbol | 3 | < 0.1% |
| Control | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4833 | |
| o | 4638 | |
| a | 4631 | |
| n | 3852 | 8.3% |
| l | 3690 | 8.0% |
| i | 3474 | 7.5% |
| r | 3299 | 7.1% |
| s | 2912 | 6.3% |
| t | 2354 | 5.1% |
| u | 2216 | 4.8% |
| Other values (18) | 10458 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3033 | |
| D | 2818 | |
| A | 1901 | |
| B | 1727 | |
| H | 1016 | 5.7% |
| L | 881 | 4.9% |
| F | 795 | 4.4% |
| S | 790 | 4.4% |
| I | 639 | 3.6% |
| T | 620 | 3.5% |
| Other values (16) | 3667 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2166 | |
| 0 | 2103 | |
| 1 | 2016 | |
| 3 | 1706 | |
| 4 | 1704 | |
| 7 | 1494 | |
| 6 | 875 | |
| 5 | 713 | 5.2% |
| 8 | 664 | 4.8% |
| 9 | 365 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 185 | |
| . | 76 | |
| , | 2 | 0.8% |
| & | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8641 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5178 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 188 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 187 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Control
| Value | Count | Frequency (%) |
| 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64244 | |
| Common | 28272 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4833 | 7.5% |
| o | 4638 | 7.2% |
| a | 4631 | 7.2% |
| n | 3852 | 6.0% |
| l | 3690 | 5.7% |
| i | 3474 | 5.4% |
| r | 3299 | 5.1% |
| C | 3033 | 4.7% |
| s | 2912 | 4.5% |
| D | 2818 | 4.4% |
| Other values (44) | 27064 |
Common
| Value | Count | Frequency (%) |
| 8641 | ||
| - | 5178 | |
| 2 | 2166 | 7.7% |
| 0 | 2103 | 7.4% |
| 1 | 2016 | 7.1% |
| 3 | 1706 | 6.0% |
| 4 | 1704 | 6.0% |
| 7 | 1494 | 5.3% |
| 6 | 875 | 3.1% |
| 5 | 713 | 2.5% |
| Other values (13) | 1676 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92497 | |
| None | 17 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8641 | 9.3% | |
| - | 5178 | 5.6% |
| e | 4833 | 5.2% |
| o | 4638 | 5.0% |
| a | 4631 | 5.0% |
| n | 3852 | 4.2% |
| l | 3690 | 4.0% |
| i | 3474 | 3.8% |
| r | 3299 | 3.6% |
| C | 3033 | 3.3% |
| Other values (62) | 47228 |
None
| Value | Count | Frequency (%) |
| é | 12 | |
| è | 4 | 23.5% |
| Â | 1 | 5.9% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| ’ | 1 |
all_aboard
Real number (ℝ)
| Distinct | 244 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 17 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.147242 |
| Minimum | 0 |
|---|---|
| Maximum | 644 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 16 |
| Q3 | 35 |
| 95-th percentile | 117.8 |
| Maximum | 644 |
| Range | 644 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 45.499656 |
|---|---|
| Coefficient of variation (CV) | 1.4607925 |
| Kurtosis | 23.92979 |
| Mean | 31.147242 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 3.9190237 |
| Sum | 155269 |
| Variance | 2070.2187 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 280 | 5.6% |
| 2 | 245 | 4.9% |
| 4 | 202 | 4.0% |
| 5 | 189 | 3.8% |
| 10 | 179 | 3.6% |
| 6 | 174 | 3.5% |
| 7 | 164 | 3.3% |
| 1 | 137 | 2.7% |
| 9 | 130 | 2.6% |
| 11 | 128 | 2.6% |
| Other values (234) | 3157 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 137 | |
| 2 | 245 | |
| 3 | 280 | |
| 4 | 202 | |
| 5 | 189 | |
| 6 | 174 | |
| 7 | 164 | |
| 8 | 119 | |
| 9 | 130 |
| Value | Count | Frequency (%) |
| 644 | 1 | |
| 524 | 1 | |
| 517 | 1 | |
| 394 | 1 | |
| 393 | 1 | |
| 384 | 1 | |
| 356 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 340 | 1 |
PASAJEROS A BORDO
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 234 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 219 |
| Missing (%) | 4.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.899645 |
| Minimum | 0 |
|---|---|
| Maximum | 614 |
| Zeros | 866 |
| Zeros (%) | 17.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 12 |
| Q3 | 30 |
| 95-th percentile | 111.9 |
| Maximum | 614 |
| Range | 614 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 44.047016 |
|---|---|
| Coefficient of variation (CV) | 1.6374572 |
| Kurtosis | 24.1706 |
| Mean | 26.899645 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 3.93525 |
| Sum | 128661 |
| Variance | 1940.1397 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 866 | 17.3% |
| 4 | 170 | 3.4% |
| 2 | 162 | 3.2% |
| 5 | 140 | 2.8% |
| 3 | 130 | 2.6% |
| 7 | 130 | 2.6% |
| 9 | 128 | 2.6% |
| 10 | 128 | 2.6% |
| 8 | 126 | 2.5% |
| 1 | 119 | 2.4% |
| Other values (224) | 2684 | |
| (Missing) | 219 | 4.4% |
| Value | Count | Frequency (%) |
| 0 | 866 | |
| 1 | 119 | 2.4% |
| 2 | 162 | 3.2% |
| 3 | 130 | 2.6% |
| 4 | 170 | 3.4% |
| 5 | 140 | 2.8% |
| 6 | 109 | 2.2% |
| 7 | 130 | 2.6% |
| 8 | 126 | 2.5% |
| 9 | 128 | 2.6% |
| Value | Count | Frequency (%) |
| 614 | 1 | |
| 509 | 1 | |
| 503 | 1 | |
| 381 | 1 | |
| 374 | 1 | |
| 364 | 1 | |
| 338 | 1 | |
| 335 | 1 | |
| 327 | 1 | |
| 316 | 1 |
crew_aboard
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 217 |
| Missing (%) | 4.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5216301 |
| Minimum | 0 |
|---|---|
| Maximum | 83 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 83 |
| Range | 83 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.7586025 |
|---|---|
| Coefficient of variation (CV) | 0.83124944 |
| Kurtosis | 62.87587 |
| Mean | 4.5216301 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.96181 |
| Sum | 21636 |
| Variance | 14.127093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 954 | |
| 2 | 828 | |
| 4 | 694 | |
| 1 | 532 | |
| 5 | 513 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| 10 | 94 | 1.9% |
| Other values (24) | 263 | 5.3% |
| (Missing) | 217 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.1% |
| 1 | 532 | |
| 2 | 828 | |
| 3 | 954 | |
| 4 | 694 | |
| 5 | 513 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| Value | Count | Frequency (%) |
| 83 | 1 | < 0.1% |
| 61 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 4 |
cantidad de fallecidos
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 199 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 8 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.310773 |
| Minimum | 0 |
|---|---|
| Maximum | 583 |
| Zeros | 76 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 11 |
| Q3 | 25 |
| 95-th percentile | 85 |
| Maximum | 583 |
| Range | 583 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 35.01637 |
|---|---|
| Coefficient of variation (CV) | 1.5694826 |
| Kurtosis | 36.822466 |
| Mean | 22.310773 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 4.6201663 |
| Sum | 111420 |
| Variance | 1226.1461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 381 | 7.6% |
| 2 | 377 | 7.5% |
| 3 | 363 | 7.3% |
| 4 | 242 | 4.8% |
| 5 | 234 | 4.7% |
| 6 | 176 | 3.5% |
| 7 | 160 | 3.2% |
| 10 | 159 | 3.2% |
| 13 | 132 | 2.6% |
| 9 | 128 | 2.6% |
| Other values (189) | 2642 |
| Value | Count | Frequency (%) |
| 0 | 76 | 1.5% |
| 1 | 381 | |
| 2 | 377 | |
| 3 | 363 | |
| 4 | 242 | |
| 5 | 234 | |
| 6 | 176 | |
| 7 | 160 | |
| 8 | 128 | 2.6% |
| 9 | 128 | 2.6% |
| Value | Count | Frequency (%) |
| 583 | 1 | |
| 520 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 329 | 1 | |
| 301 | 1 | |
| 298 | 1 | |
| 290 | 1 | |
| 275 | 1 | |
| 271 | 1 |
passenger_fatalities
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 190 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 233 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.956385 |
| Minimum | 0 |
|---|---|
| Maximum | 560 |
| Zeros | 1037 |
| Zeros (%) | 20.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 8 |
| Q3 | 21 |
| 95-th percentile | 81 |
| Maximum | 560 |
| Range | 560 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 34.07517 |
|---|---|
| Coefficient of variation (CV) | 1.7975564 |
| Kurtosis | 36.929095 |
| Mean | 18.956385 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.6447956 |
| Sum | 90403 |
| Variance | 1161.1172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1037 | |
| 1 | 307 | 6.1% |
| 2 | 263 | 5.3% |
| 3 | 193 | 3.9% |
| 4 | 185 | 3.7% |
| 5 | 139 | 2.8% |
| 6 | 133 | 2.7% |
| 7 | 126 | 2.5% |
| 8 | 126 | 2.5% |
| 9 | 118 | 2.4% |
| Other values (180) | 2142 | |
| (Missing) | 233 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 1037 | |
| 1 | 307 | 6.1% |
| 2 | 263 | 5.3% |
| 3 | 193 | 3.9% |
| 4 | 185 | 3.7% |
| 5 | 139 | 2.8% |
| 6 | 133 | 2.7% |
| 7 | 126 | 2.5% |
| 8 | 126 | 2.5% |
| 9 | 118 | 2.4% |
| Value | Count | Frequency (%) |
| 560 | 1 | |
| 505 | 1 | |
| 335 | 1 | |
| 316 | 1 | |
| 307 | 1 | |
| 287 | 1 | |
| 283 | 1 | |
| 278 | 1 | |
| 258 | 1 | |
| 257 | 1 |
crew_fatalities
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 233 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5890124 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 398 |
| Zeros (%) | 8.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.1775106 |
|---|---|
| Coefficient of variation (CV) | 0.88534402 |
| Kurtosis | 12.868381 |
| Mean | 3.5890124 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.4992005 |
| Sum | 17116 |
| Variance | 10.096574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 892 | |
| 3 | 824 | |
| 1 | 770 | |
| 4 | 591 | |
| 5 | 401 | |
| 0 | 398 | |
| 6 | 273 | 5.5% |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| 9 | 87 | 1.7% |
| Other values (18) | 232 | 4.6% |
| (Missing) | 233 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 398 | |
| 1 | 770 | |
| 2 | 892 | |
| 3 | 824 | |
| 4 | 591 | |
| 5 | 401 | |
| 6 | 273 | 5.5% |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| 9 | 87 | 1.7% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 23 | 6 | |
| 22 | 5 | |
| 21 | 2 | < 0.1% |
| 20 | 3 | |
| 19 | 5 | |
| 18 | 3 |
ground
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 44 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7204518 |
| Minimum | 0 |
|---|---|
| Maximum | 2750 |
| Zeros | 4710 |
| Zeros (%) | 94.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.15 |
| Maximum | 2750 |
| Range | 2750 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 55.529088 |
|---|---|
| Coefficient of variation (CV) | 32.275876 |
| Kurtosis | 2420.877 |
| Mean | 1.7204518 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 48.957163 |
| Sum | 8530 |
| Variance | 3083.4796 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4710 | |
| 1 | 63 | 1.3% |
| 2 | 34 | 0.7% |
| 3 | 21 | 0.4% |
| 4 | 16 | 0.3% |
| 5 | 12 | 0.2% |
| 7 | 10 | 0.2% |
| 8 | 9 | 0.2% |
| 10 | 6 | 0.1% |
| 6 | 6 | 0.1% |
| Other values (41) | 71 | 1.4% |
| (Missing) | 44 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 4710 | |
| 1 | 63 | 1.3% |
| 2 | 34 | 0.7% |
| 3 | 21 | 0.4% |
| 4 | 16 | 0.3% |
| 5 | 12 | 0.2% |
| 6 | 6 | 0.1% |
| 7 | 10 | 0.2% |
| 8 | 9 | 0.2% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2750 | 2 | |
| 225 | 1 | |
| 125 | 2 | |
| 113 | 1 | |
| 87 | 1 | |
| 85 | 1 | |
| 78 | 1 | |
| 71 | 1 | |
| 63 | 1 | |
| 58 | 1 |
summary
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 4851 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 59 |
| Missing (%) | 1.2% |
| Memory size | 1.4 MiB |
| Crashed under unknown circumstances. | 9 |
|---|---|
| Crashed while en route. | 8 |
| Crashed while attempting to land. | 7 |
| Crashed during takeoff. | 6 |
| Crashed into the sea. | 5 |
| Other values (4846) |
Length
| Max length | 2669 |
|---|---|
| Median length | 787 |
| Mean length | 223.39794 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1104256 |
|---|---|
| Distinct characters | 101 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 4807 ? |
|---|---|
| Unique (%) | 97.2% |
Sample
| 1st row | Crashed into trees while attempting to land after being shot down by British and French aircraft. |
|---|---|
| 2nd row | Exploded and burned near Neuwerk Island, when hydrogen gas, being vented, was ignited by lightning. |
| 3rd row | Crashed near the Black Sea, cause unknown. |
| 4th row | Shot down by British aircraft crashing in flames. |
| 5th row | Shot down in flames by the British 39th Home Defence Squadron. |
Common Values
| Value | Count | Frequency (%) |
| Crashed under unknown circumstances. | 9 | 0.2% |
| Crashed while en route. | 8 | 0.2% |
| Crashed while attempting to land. | 7 | 0.1% |
| Crashed during takeoff. | 6 | 0.1% |
| Crashed into the sea. | 5 | 0.1% |
| Crashed shortly after taking off. | 5 | 0.1% |
| Crashed on takeoff. | 4 | 0.1% |
| Crashed under unknown circumstances | 4 | 0.1% |
| Crashed en route. | 4 | 0.1% |
| Shot down by rebel forces. | 4 | 0.1% |
| Other values (4841) | 4887 | |
| (Missing) | 59 | 1.2% |
Length
| Value | Count | Frequency (%) |
| the | 18449 | 10.1% |
| of | 5538 | 3.0% |
| a | 5446 | 3.0% |
| and | 5436 | 3.0% |
| to | 5425 | 3.0% |
| in | 3675 | 2.0% |
| crashed | 3386 | 1.9% |
| was | 2773 | 1.5% |
| aircraft | 2556 | 1.4% |
| into | 2356 | 1.3% |
| Other values (11551) | 127814 |
Most occurring characters
| Value | Count | Frequency (%) |
| 179143 | ||
| e | 104790 | 9.5% |
| t | 81829 | 7.4% |
| a | 79836 | 7.2% |
| n | 68037 | 6.2% |
| i | 65785 | 6.0% |
| r | 63354 | 5.7% |
| o | 62537 | 5.7% |
| h | 42752 | 3.9% |
| s | 39752 | 3.6% |
| Other values (91) | 316441 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 868368 | |
| Space Separator | 179150 | 16.2% |
| Uppercase Letter | 25255 | 2.3% |
| Other Punctuation | 20590 | 1.9% |
| Decimal Number | 8833 | 0.8% |
| Dash Punctuation | 1642 | 0.1% |
| Close Punctuation | 158 | < 0.1% |
| Open Punctuation | 140 | < 0.1% |
| Final Punctuation | 67 | < 0.1% |
| Control | 33 | < 0.1% |
| Other values (4) | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 104790 | |
| t | 81829 | 9.4% |
| a | 79836 | 9.2% |
| n | 68037 | 7.8% |
| i | 65785 | 7.6% |
| r | 63354 | 7.3% |
| o | 62537 | 7.2% |
| h | 42752 | 4.9% |
| s | 39752 | 4.6% |
| d | 38361 | 4.4% |
| Other values (30) | 221335 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 5791 | |
| C | 2773 | |
| A | 2576 | |
| S | 1527 | 6.0% |
| F | 1284 | 5.1% |
| M | 1206 | 4.8% |
| I | 1062 | 4.2% |
| P | 960 | 3.8% |
| W | 922 | 3.7% |
| N | 860 | 3.4% |
| Other values (16) | 6294 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13466 | |
| , | 5709 | |
| ' | 770 | 3.7% |
| " | 362 | 1.8% |
| / | 170 | 0.8% |
| : | 56 | 0.3% |
| ; | 34 | 0.2% |
| & | 17 | 0.1% |
| % | 3 | < 0.1% |
| # | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2661 | |
| 1 | 1365 | |
| 2 | 1040 | 11.8% |
| 5 | 827 | 9.4% |
| 3 | 819 | 9.3% |
| 4 | 577 | 6.5% |
| 6 | 431 | 4.9% |
| 7 | 415 | 4.7% |
| 8 | 386 | 4.4% |
| 9 | 312 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 179143 | ||
| Â | 7 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 | |
| ] | 1 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 139 | |
| [ | 1 | 0.7% |
Control
| Value | Count | Frequency (%) |
| 32 | ||
| 1 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1642 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 7 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 893623 | |
| Common | 210633 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 104790 | |
| t | 81829 | 9.2% |
| a | 79836 | 8.9% |
| n | 68037 | 7.6% |
| i | 65785 | 7.4% |
| r | 63354 | 7.1% |
| o | 62537 | 7.0% |
| h | 42752 | 4.8% |
| s | 39752 | 4.4% |
| d | 38361 | 4.3% |
| Other values (56) | 246590 |
Common
| Value | Count | Frequency (%) |
| 179143 | ||
| . | 13466 | 6.4% |
| , | 5709 | 2.7% |
| 0 | 2661 | 1.3% |
| - | 1642 | 0.8% |
| 1 | 1365 | 0.6% |
| 2 | 1040 | 0.5% |
| 5 | 827 | 0.4% |
| 3 | 819 | 0.4% |
| ' | 770 | 0.4% |
| Other values (25) | 3191 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1104114 | |
| None | 72 | < 0.1% |
| Punctuation | 70 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 179143 | ||
| e | 104790 | 9.5% |
| t | 81829 | 7.4% |
| a | 79836 | 7.2% |
| n | 68037 | 6.2% |
| i | 65785 | 6.0% |
| r | 63354 | 5.7% |
| o | 62537 | 5.7% |
| h | 42752 | 3.9% |
| s | 39752 | 3.6% |
| Other values (73) | 316299 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 | |
| ‘ | 3 | 4.3% |
None
| Value | Count | Frequency (%) |
| é | 20 | |
| á | 15 | |
| Ã | 8 | 11.1% |
| Â | 7 | 9.7% |
| ó | 3 | 4.2% |
| ° | 3 | 4.2% |
| ö | 3 | 4.2% |
| ü | 2 | 2.8% |
| ð | 2 | 2.8% |
| ã | 2 | 2.8% |
| Other values (6) | 7 | 9.7% |
year
Real number (ℝ)
| Distinct | 107 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1970.923 |
| Minimum | 1915 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1915 |
|---|---|
| 5-th percentile | 1931 |
| Q1 | 1951 |
| median | 1970 |
| Q3 | 1992 |
| 95-th percentile | 2010 |
| Maximum | 2021 |
| Range | 106 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 24.632185 |
|---|---|
| Coefficient of variation (CV) | 0.012497791 |
| Kurtosis | -0.96532199 |
| Mean | 1970.923 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.024044307 |
| Sum | 9858557 |
| Variance | 606.74452 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1946 | 88 | 1.8% |
| 1989 | 83 | 1.7% |
| 1947 | 82 | 1.6% |
| 1948 | 78 | 1.6% |
| 1962 | 78 | 1.6% |
| 1972 | 77 | 1.5% |
| 1945 | 75 | 1.5% |
| 1951 | 75 | 1.5% |
| 1994 | 74 | 1.5% |
| 1970 | 73 | 1.5% |
| Other values (97) | 4219 |
| Value | Count | Frequency (%) |
| 1915 | 2 | < 0.1% |
| 1916 | 5 | 0.1% |
| 1917 | 7 | 0.1% |
| 1918 | 4 | 0.1% |
| 1919 | 9 | |
| 1920 | 18 | |
| 1921 | 12 | |
| 1922 | 13 | |
| 1923 | 13 | |
| 1924 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 2021 | 7 | 0.1% |
| 2020 | 8 | 0.2% |
| 2019 | 13 | |
| 2018 | 19 | |
| 2017 | 15 | |
| 2016 | 23 | |
| 2015 | 18 | |
| 2014 | 23 | |
| 2013 | 25 | |
| 2012 | 26 |
| all_aboard | PASAJEROS A BORDO | crew_aboard | cantidad de fallecidos | passenger_fatalities | crew_fatalities | ground | year | |
|---|---|---|---|---|---|---|---|---|
| all_aboard | 1.000 | 0.966 | 0.667 | 0.745 | 0.781 | 0.368 | 0.038 | 0.171 |
| PASAJEROS A BORDO | 0.966 | 1.000 | 0.503 | 0.708 | 0.819 | 0.232 | 0.020 | 0.165 |
| crew_aboard | 0.667 | 0.503 | 1.000 | 0.521 | 0.379 | 0.689 | 0.099 | 0.109 |
| cantidad de fallecidos | 0.745 | 0.708 | 0.521 | 1.000 | 0.940 | 0.680 | -0.007 | 0.110 |
| passenger_fatalities | 0.781 | 0.819 | 0.379 | 0.940 | 1.000 | 0.457 | -0.025 | 0.109 |
| crew_fatalities | 0.368 | 0.232 | 0.689 | 0.680 | 0.457 | 1.000 | 0.041 | 0.041 |
| ground | 0.038 | 0.020 | 0.099 | -0.007 | -0.025 | 0.041 | 1.000 | 0.057 |
| year | 0.171 | 0.165 | 0.109 | 0.110 | 0.109 | 0.041 | 0.057 | 1.000 |
| fecha | Ruta | OperadOR | route | ac_type | all_aboard | PASAJEROS A BORDO | crew_aboard | cantidad de fallecidos | passenger_fatalities | crew_fatalities | ground | summary | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | 1915-03-05 | Tienen, Belgium | Military - German Navy | NaN | Zeppelin L-8 (airship) | 41.0 | 0.0 | 41.0 | 17.0 | 0.0 | 17.0 | 0.0 | Crashed into trees while attempting to land after being shot down by British and French aircraft. | 1915 |
| 7 | 1915-09-03 | Off Cuxhaven, Germany | Military - German Navy | NaN | Zeppelin L-10 (airship) | 19.0 | NaN | NaN | 19.0 | NaN | NaN | 0.0 | Exploded and burned near Neuwerk Island, when hydrogen gas, being vented, was ignited by lightning. | 1915 |
| 8 | 1916-07-28 | Near Jambol, Bulgeria | Military - German Army | NaN | Schutte-Lanz S-L-10 (airship) | 20.0 | NaN | NaN | 20.0 | NaN | NaN | 0.0 | Crashed near the Black Sea, cause unknown. | 1916 |
| 9 | 1916-09-24 | Billericay, England | Military - German Navy | NaN | Zeppelin L-32 (airship) | 22.0 | NaN | NaN | 22.0 | NaN | NaN | 0.0 | Shot down by British aircraft crashing in flames. | 1916 |
| 10 | 1916-10-01 | Potters Bar, England | Military - German Navy | NaN | Zeppelin L-31 (airship) | 19.0 | 0.0 | 19.0 | 19.0 | 0.0 | 19.0 | 0.0 | Shot down in flames by the British 39th Home Defence Squadron. | 1916 |
| 11 | 1916-11-21 | Mainz, Germany | Military - German Army | NaN | Super Zeppelin (airship) | 28.0 | NaN | NaN | 27.0 | NaN | NaN | 0.0 | Crashed in a storm. | 1916 |
| 12 | 1916-11-28 | Off West Hartlepool, England | Military - German Navy | NaN | Zeppelin L-34 (airship) | 20.0 | NaN | NaN | 20.0 | NaN | NaN | 0.0 | Shot down by British anti-aircraft fire and aircraft and crashed into the North Sea. | 1916 |
| 13 | 1917-03-04 | Near Gent, Belgium | Military - German Army | NaN | Airship | 20.0 | NaN | NaN | 20.0 | NaN | NaN | 0.0 | Caught fire and crashed. | 1917 |
| 14 | 1917-03-30 | Off Northern Germany | Military - German Navy | NaN | Schutte-Lanz S-L-9 (airship) | 23.0 | NaN | NaN | 23.0 | NaN | NaN | 0.0 | Struck by lightning and crashed into the Baltic Sea. | 1917 |
| 15 | 1917-05-14 | Near Texel Island, North Sea | Military - German Navy | NaN | Zeppelin L-22 (airship) | 21.0 | NaN | NaN | 21.0 | NaN | NaN | 0.0 | Crashed into the sea from an altitude of 3,000 ft. after being hit by British aircraft fire. | 1917 |
| fecha | Ruta | OperadOR | route | ac_type | all_aboard | PASAJEROS A BORDO | crew_aboard | cantidad de fallecidos | passenger_fatalities | crew_fatalities | ground | summary | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4998 | 2020-08-07 | Calicut, India | Air India Exppress | Dubai - Calicut | Boeing 737-8HG | 190.0 | 184.0 | 6.0 | 20.0 | 18.0 | 2.0 | 0.0 | The flight IX344 suffered a runway excursion while landing at Kozhikode-Calicut Airport in heavy rain. The nose section separated from the fuselage after going down a steep slope at the end of the runway. The pilot and copilot were among the dead. Low visibility, wet runway, low cloud base and poor braking action possibly contributed to the accident. | 2020 |
| 4999 | 2020-08-22 | Juba, South Sudan | South West Aviaiton | Juba - Wau | Antonov 26B | 8.0 | 5.0 | 3.0 | 7.0 | 4.0 | 3.0 | 0.0 | The cargo plane lost height shortly after departure from Juba Airport and impacted a farm near Hai Referendum about 3nm southwest of the airport. One passenger survived in critical condition. The plane was chartered by the World Food Program to transport supplies and wages to Wau and Aweil. | 2020 |
| 5000 | 2020-09-25 | Near Chuguev, Ukraine | Military - Ukraine Air Force | Training | Antonov An26SH | 27.0 | 20.0 | 7.0 | 26.0 | 19.0 | 7.0 | 0.0 | The military transport, crashed 1.2 miles from Chuguev air base. The plane was carrying cadets from a nearby air force university on a training flight. The crew may have reported failure of an engine prior to the accident. | 2020 |
| 5001 | 2021-01-09 | Near Jakarta, Indonesia | Sriwijaya Air | Jakarta - Pontianak | Boeing 737-524 | 62.0 | 56.0 | 6.0 | 62.0 | 56.0 | 6.0 | 0.0 | Sriwijaya Air flight 182 was climbing through 10,900 ft., 11 nm north of Jakarta-Soekarno-Hatta International Airport, over the Java Sea when radar and radio contact was lost. The aircraft then lost height rapidly and impacted the Java Sea. Debris was located near Lancang Island. | 2021 |
| 5002 | 2021-03-02 | Pieri, Sudan | South Sudan Supreme Airlines | Pieri - Yuai | Let L-410UVP-E | 10.0 | 8.0 | 2.0 | 10.0 | 8.0 | 2.0 | 0.0 | One of the engines on the aircraft failed 10 minutes after takeof. When the plane turned back, the second engine failed. | 2021 |
| 5003 | 2021-03-28 | Near Butte, Alaska | Soloy Helicopters | Sightseeing Charter | Eurocopter AS350B3Â Ecureuil | 6.0 | 5.0 | 1.0 | 5.0 | 4.0 | 1.0 | 0.0 | The sightseeing helicopter crashed after missing the top of a 6,000 ft mountain by just 10 - 15 ft. The crash site was near Knik glacier. The pilot, and four others were killed including Czech billionaire Petr Kellner. | 2021 |
| 5004 | 2021-05-21 | Near Kaduna, Nigeria | Military - Nigerian Air Force | NaN | Beechcraft B300 King Air 350i | 11.0 | 7.0 | 4.0 | 11.0 | 7.0 | 4.0 | 0.0 | While on final approach, in poor weather conditions, the aircraft crashed and burst into flames less than 10 km from Kaduna Airport. All 11 occupants were killed, incuding General Ibrahim Attahiru, Chief of Staff of the Nigerian Army. | 2021 |
| 5005 | 2021-06-10 | Near Pyin Oo Lwin, Myanmar | Military - Myanmar Air Force | Naypyidaw - Anisakan | Beechcraft 1900D | 14.0 | 12.0 | 2.0 | 12.0 | 11.0 | 1.0 | 0.0 | The plane was carrying military personnel and monks when it crashed about 300 meters from a steel plant in the Mandalay region. The plane was attempting to land in poor weather conditions and broke into three pieces. | 2021 |
| 5006 | 2021-07-04 | Patikul, Sulu, Philippines | Military - Philippine Air Force | Cagayan de Oro-Lumbia - Jolo | Lockheed C-130H Hercules | 96.0 | 88.0 | 8.0 | 50.0 | NaN | NaN | 3.0 | While attempting to land at Jolo Airport, the military transport overran the runway, struck two houses and burst into flames coming to rest on a coconut plantation. | 2021 |
| 5007 | 2021-07-06 | Palana, Russia | Kamchatka Aviation Enterprise | Petropavlovsk - Palana | Antonov An 26B-100 | 28.0 | 22.0 | 6.0 | 28.0 | 22.0 | 6.0 | 0.0 | The passenger plane crashed into the top of a cliff while attempting to land in inclement weather. The debris fell into the sea. Contact was lost with the plane 10 minutes before it was to land. | 2021 |